C4 trees have a broader niche than their close C3 relatives

While C3and C4trees inhabit similar environments, C4photosynthesis in trees expands the ecological niche relative to that of C3tree species.


Introduction
It is widely accepted that the ecological niche of a given plant species is influenced by its photosynthetic efficiency (Black, 1971;Sage et al., 1999;Lundgren et al., 2015). Modifications to the photosynthetic apparatus can increase or decrease efficiency depending on environmental conditions. One such modification that increases carbon, water, and nitrogen use efficiencies is the C 4 photosynthetic pathway (Evans, 2013). This pathway largely eliminates the energetically costly process of photorespiration, which occurs when the carbon-fixing enzyme Rubisco catalyses the fixation of oxygen instead of CO 2 , resulting in the accumulation of toxic by-products that need to be recycled (Bräutigam and Gowik, 2016). Certain environmental conditions are known to increase the rate of photorespiration, including low CO 2 concentrations, warmth, bright light, aridity, and salinity (Chollet and Ogren, 1975;Ehleringer et al., 1991;Sage et al., 2018). As a result, species utilizing C 4 photosynthesis theoretically perform better than plants using the ancestral C 3 pathway in these environments (Pearcy and Ehleringer, 1984). This, at least in part, explains the well-reported differences in global distribution patterns for C 3 and C 4 species (reviewed in Ehleringer et al., 1997;Christin and Osborne, 2014).
Ecological sorting of photosynthetic types along temperature gradients is particularly apparent in grasses, where C 4 grasses are abundant at high temperatures and give way to C 3 grasses as temperature declines (Teeri and Stowe, 1976;Bremond et al., 2012;Pau et al., 2013). For eudicot herbaceous species, water availability seems to be the most important determinant of distribution, with C 4 forbs being favoured over their C 3 counterparts in areas with limited water supply (Stowe and Teeri, 1978;Pyankov et al., 2010). Furthermore, species using the CAM (Crassulacean acid metabolism) photosynthetic pathway have advantages over both C 3 and C 4 species with respect to their maximum potential water use efficiencies, and frequently dominate in the most arid environments (Orsenigo et al., 1997;Black and Osmond, 2003). However, epiphytic CAM species can also be abundant in tropical rainforests, which have very high levels of precipitation, as it is the epiphytic growth form which drives their water limitation (Whittaker, 1975;Quezada and Gianoli, 2011). The difference in the major environmental predictors of photosynthetic type distribution between C 3 and C 4 grasses and forbs, as well as the contrasting environments of CAM species highlights the importance of distinguishing growth forms when examining the effect of photosynthetic type on the ecological niche. However, the relationship between photosynthetic type and ecology has not yet been investigated in the tree growth form.
The effects of photosynthetic type on the ecological and geographical distributions of trees may be different from those of herbaceous species for three reasons. First, there are key differences between monocots and eudicots, the plant clades which contain all known C 4 species (Sage, 2016). More than half of all the >60 C 4 origins have occurred in eudicots, but these lineages account for less than a quarter of all known C 4 species (Sage, 2016). Quantum yield (defined as the rate of photosynthesis relative to photon absorption) is generally lower in the eudicots-which includes almost all true tree species-than in the monocots. This results in generally poorer shade tolerance among C 4 eudicots compared with monocots that could have implications for their distributions, and thus the relationship between photosynthetic type and ecological niche (Ehleringer et al., 1997). This poor shade tolerance may be augmented by a potentially reduced capacity for exploiting sunflecks in C 4 versus C 3 plants, although this response does not appear to be consistent or universal given that reduced sunfleck use efficiency is not observed in all species and growth forms of C 4 plants (Pearcy et al., 1985;Krall and Pearcy, 1993; discussed in Sage and McKown, 2006;Sage, 2014). Secondly, life history influences ecological niche (Pyankov et al., 2010;Liu et al., 2019). Pyankov et al. (2010) hypothesize that the abundance of C 4 annual species compared with C 4 perennials reflects the fact that C 4 photosynthesis confers the greatest fitness benefit over short periods of time. Therefore, tree species, which have long life spans, may not benefit as much from C 4 photosynthesis as short-lived herbaceous species. Furthermore, Liu et al. (2019) demonstrate that the variation in niche descriptors of C 3 and C 4 subtropical grasses was best explained by differences in life history: annual subtropical grasses (particularly C 4 annuals) tend to grow in regions with higher temperatures and lower, more seasonal precipitation compared with perennial grasses. Thirdly, growth form probably influences the ecological niche through functions such as water transport or light competition rather than photosynthetic type. Tall plant species such as trees typically exhibit wider water conduits than shorter plants, and these wider conduits are more vulnerable to embolism under drought or freezing (Olson et al., 2018). This means that the tree state may result in an intrinsically higher degree of vulnerability to drought, and this vulnerability would not necessarily be alleviated by the C 4 pathway. This problem may be exacerbated where the C 4 state is recently evolved (i.e. a tree evolves C 4 or the tree state evolves in a young C 4 lineage), as shown by a study of young C 4 grass lineages which demonstrates that leaf hydraulic conductance is increased due to the anatomy required by the C 4 system, despite the C 4 state reducing hydraulic demand, resulting in less negative turgor loss points in young compared with old C 4 lineages (Zhou et al., 2020, Preprint). As such, a tall C 4 plant such as a tree, particularly one belonging to a young C 4 lineage, would theoretically not be able to access more arid environments than their C 3 counterparts. This is reflected in the apparent height limitation of the only known C 4 trees (found in one of the older C 4 eudicot lineages), which are not observed to exceed 10 m in height (Young et al., 2020). Despite these clear differences across life forms, histories, and growth patterning, the ecological sorting of photosynthetic diversity in trees remains largely unexplored.
Evolutionary history is another important factor determining the ecological sorting of photosynthetic diversity. Comparisons between distantly related grass species have often revealed more pronounced differences between the ecological distributions of C 3 and C 4 species than studies comparing photosynthetic diversity between closely related species or within a single species (Pau et al., 2013;Lundgren et al., 2015). The broad differences identified between C 3 and C 4 species probably reflect the fact that distribution patterns are not solely determined by the acquisition of C 4 photosynthesis and are affected by the ecology and functional traits of the ancestral C 3 lineages (Edwards and Still, 2008;Edwards and Smith, 2010). With time, niche specialization may then occur after the initial emergence of C 4 physiology, depending on differences in functional traits, resulting in some C 4 taxa becoming specialized to environments different from those of their C 3 ancestors and generating an apparent niche shift. This means that comparisons between distantly related species of differing photosynthetic type may reveal pronounced differences between C 4 and C 3 species, but these differences may only be partly driven by photosynthetic type. Furthermore, studies that look at intraspecific photosynthetic diversity offer a closer look at the direct consequences of C 4 physiology. Indeed, a study on the intraspecific photosynthetic diversity of the grass Alloteropsis semialata-the only species with both C 4 and non-C 4 genotypes-showed that C 4 photosynthesis actually broadens the ecological niche, rather than shifting it away from that of ancestral C 3 lineages (Lundgren et al., 2015). With this in mind, our study set out to determine the ecological sorting of closely related C 3 and C 4 trees, as well as any influence that evolutionary history may have had on their ecological distributions.
Tree photosynthetic diversity only exists within Euphorbiaceae, a morphologically diverse plant family with C 3 , C 4 , and CAM tree species (Table 1; Webster et al., 1975;Webster, 1994;Horn et al., 2012). The Chamaesyce clade of Euphorbia (Euphorbiaceae) is the largest single C 4 lineage among the eudicots, with 350 C 4 species and containing the only true C 4 trees, defined here as tall, perennial, woody life forms with secondary growth and C 4 leaves (Pearcy and Troughton, 1975;Yang and Berry, 2011;Young et al., 2020). These C 4 trees are endemic to the Hawaiian Islands, where they diversified from a likely herbaceous ancestor that arrived on the islands ~5 million years ago (Yang et al., 2018). This diversification event yielded five C 4 trees in present-day Hawaii, which, when combined with the 17 C 3 and three CAM Euphorbiaceae tree species currently on these islands (Table 1), makes the Hawaiian Islands the global centre of photosynthetic diversity in trees. The environment on the islands is highly heterogenous, and trees in Euphorbiaceae occupy a range of environments from bright, open scrubland to mesic forest where they experience differing temperatures, precipitation levels, and light availability (Table 1; Sporck, 2011;Sage and Sultmanis, 2016). Hawaiian trees in Euphorbiaceae therefore provide a unique opportunity to compare the ecological niches of tree species with diverse photosynthetic backgrounds from a similar geographic region.
Here, we combine geographical occurrence and environmental datasets in a phylogenetic framework to examine the phylogeography of photosynthetic diversity in trees in order to elucidate the geographical and environmental factors that are permissive for photosynthetic innovation in trees. Based on our current knowledge of the sorting of photosynthetic diversity among herbaceous species, we hypothesize that C 4 trees occur in drier, hotter, and more open environments than their C 3 counterparts (Table 2), with the differences in water availability acting as the most important determinant of distribution as for herbaceous eudicots. However, as discussed above, these environmental differences may be less pronounced than those for herbaceous species given the theoretically reduced benefits of C 4 photosynthesis over long life spans and in the tree growth form. With only three CAM tree species which occur in small numbers on the Hawaiian Islands, we were unable to perform robust analyses on the phylogeography of this photosynthetic type but instead report on its distribution in relation to C 3 and C 4 trees in our study.

Geographic distribution data for Euphorbiaceae species
Occurrence data for all species in Euphorbiaceae on the Hawaiian Islands were downloaded from the Global Biodiversity Information Facility (https://www.gbif.org, https://doi.org/10.15468/dl.bc6bus) and the Botanical Information and Ecology Network (BIEN; https://bien.nceas. ucsb.edu/bien/). Data were imported into R using the readr and BIEN packages in R, respectively (Maitner et al., 2018;Wickham et al., 2018;R Core Team, 2020). These combined datasets yielded 1509 occurrence records for 78 species.
Occurrence data were then cleaned in R to remove any records that potentially contained inaccurate or unreliable information. First, all latitude and longitude data were rounded to two decimal places, and records with incomplete coordinate, species, or country data were removed. Taxa names were updated to currently accepted names where appropriate. Secondly, any records that were identified by GBIF as having at least one of the issues listed in Supplementary Table S1 were removed. Thirdly, data were processed using the CoordinateCleaner package in R to flag records with coordinates corresponding to country capitals or centroids, the GBIF headquarters, or another biodiversity institution; records with equal, zero, or invalid values for latitude and longitude; and records with coordinates that were not on land (Zizka et al., 2019). A total of 277 flagged records (summarized in Supplementary Fig. S1) were removed. Finally, duplicate records and records associated with botanic gardens, private gardens, and arboretums were removed. Upon completion of these cleaning steps, 690 occurrence records for 52 species remained ( Supplementary Fig. S2).
The remaining records were then designated as herb, shrub, or tree based on life form information in the World Checklist of Selected Plant Families (WCSP, 2021; Table 1). For Hawaiian Euphorbia, Yang et al. (2018) was used to supplement the life form data from the WCSP. Occurrence records were also designated as having C 3 , C 4 , C 2 , or CAM photosynthetic type based on the information available in the literature (Table 1; Supplementary Fig. S3). All C 4 Hawaiian species were identified using Yang et al. (2018) and Yang and Berry (2011). Five CAM species were identified to occur on the Hawaiian Islands, including three CAM tree species, Jatropha curcas, Euphorbia lactea, and Euphorbia tirucalli (Mies et al., 1996;Mason et al., 2015;Winter and Holtum, 2015). All other records, including those whose photosynthetic type could not be found in the literature, were designated as C 3 (Table 1). Where available from the literature, δ 13 C stable isotope values are recorded in Table 1 to support these photosynthetic type designations, with values greater (i.e. less negative) than -15 per mille indicating likely C 4 biochemistry.

Environmental factors
Biogeographical parameters hypothesized to be associated with the sorting of photosynthetic types were selected for our analyses, including geographical, climatic, and ecological factors (Table 2). Elevation data for occurrence records in Hawaii were obtained from the Shuttle Radar Topography Mission (STRM) digital elevation model (DEM) at 1 arcsecond (~30 m) spatial resolution via the rgbif package in R (Farr et al., 2007;Chamberlain et al., 2022). Environmental and ecological data were obtained at ~250 m resolution from the Online Rainfall Atlas of Hawaii and Climate of Hawaii (Giambelluca et al., 2013(Giambelluca et al., , 2014. Monthly precipitation and temperature data were processed using the bioclim2 function from the climates package in R to recreate the standard 19 bioclimatic variables (VanDerWal et al., 2014). Climatic water deficit (CWD) was calculated as potential evapotranspiration minus actual evapotranspiration (Stephenson, 1998). C 3 and C 4 tree data were plotted onto the Whittaker biomes using the plotbiomes package in R (Stefan and Levin, 2019). Soil data for the Hawaiian Islands were obtained from the Hawaii Soil Atlas (http://gis.ctahr.hawaii.edu/SoilAtlas) at ~670 m spatial resolution  Life form is obtained from the Kew World Checklist of Selected Plant Families (WCSP) and species designated as one or a combination of phanerophyte (phan, small and large trees), nanophanerophyte (nanophan, shrub), succulent (succ), chamaephyte (cham, woody or herbaceous perennial), therophyte (ther, annual), or helophyte (hel, herbaceous species with roots in water/saturated soil). These life form designations were used in conjunction with information from the citations to group species into trees, shrubs, and herbs.
c Environmental data for selected tree and shrub species were recorded from herbarium datasheets available online at GBIF. and imported into ArcGIS Pro (https://www.esri.com/en-us/arcgis/ products/arcgis-pro/overview). Data were obtained for the following soil variables: fertility (including soil class and cation exchange capacity), order, organic matter content, pH, shrink-swell potential (soil stability), and water permeability. Soil data for coordinates of interest were then extracted via the Spatial Join function from the Analysis toolbox.

Phylogeny
A dated phylogenetic tree for Euphorbiaceae was generated using previously published data. Three plastid regions (trnK-matK, rbcL, and trnL-trnF) were targeted as they were the best represented for the required taxa. Initially, the trnK-matK and rbcL dataset of Tokuoka (2007) was used, adding additional taxa and trnL-trnF sequence information where available. Each gene was aligned with MAFFT v.7.017 (Katoh et al., 2002) and concatenated into a single alignment. The final alignment was 4507 bp long and included 112 species (see Supplementary Table S2  . Three different analyses were run with 100 000 000 generations, sampling a tree every 1000 generations, a Yule speciation process, a relaxed log-normal clock, and the GTR+G model. The convergence of all three runs was verified using Tracer v. 1.6.0 (Drummond and Rambaut, 2007), after which a burn-in period of 10% was set. All trees were concatenated, and a maximum credibility tree inferred mapping median ages onto the nodes.

Statistical analyses
Statistical analyses were conducted in R version 4.0.4 (R Core Team, 2020). CAM species were excluded from the analyses owing to the small sample size (0 herb, 4 shrub, and 15 tree occurrence data points). Analyses were conducted primarily on data for C 3 and C 4 trees. Data for all C 3 and C 4 life forms, including herb and shrub occurrence data, were analysed separately from the tree data (Supplementary Figs S5,S6). Soil data and environmental data were analysed separately from each other due to soil data only being available for a subset (61%) of the occurrence data points ( Supplementary Fig. S7).
To explain any ecological variation between trees of C 3 and C 4 photosynthetic types, principal component analyses (PCAs) were conducted using the FactoMineR package (Le et al., 2008). Predictor variables that were hypothesized to influence the sorting of photosynthetic types were chosen for the analysis (Table 2), including minimum temperature of the coldest month, temperature of the wettest quarter (proxy for growing season temperature), precipitation seasonality, precipitation of the driest month, CWD, solar radiation, and vegetation cover (i.e. to indicate shaded habitats). Leave-one-out cross-validation was used to estimate the number of dimensions for the PCAs. All variables were scaled to unit variance before fitting. The first PCA was performed on 268 occurrence points representing 22 tree species, comprising 17 C 3 species (n=202) and five C 4 species (n=66). The 95% confidence ellipses were calculated for each group.
A second PCA was conducted on environmental data for C 3 and C 4 herbaceous (n=21, n=178, respectively), shrub (n=119, n=47), and tree (n=202, n=66) occurrence points to examine the ecological differences between C 3 and C 4 individuals of all life forms ( Supplementary Fig. S5). Data for woody species (i.e. trees and shrubs) were considered together in order to examine the ecological differences between woody and nonwoody species. The continuous soil variables from the soil dataset were included in a third PCA. These included cation exchange capacity (CEC), organic matter content (OM), pH, shrink-swell potential (soil stability), and water permeability (K sat ). Soil data for C 3 and C 4 tree (n=134, n=29) occurrence points were included in this analysis ( Supplementary Fig. S7). In all cases, all variables were scaled to unit variance before fitting.
To identify potential correlation between environmental and geographical distances (i.e. whether C 3 and C 4 trees have expanded their environmental range as they have expanded their geographical range across the Hawaiian Islands), Mantel permutation tests were conducted using the ape package (Paradis and Schliep, 2019). Geographic distances between points were extracted and assembled into a matrix using the earth. dist function in the fossil package (Matthew, 2011), and then compared with environmental distances (Euclidian distance in the space formed by the first three axes of the PCA) to test for statistical associations between matrices of environmental and geographical data. Linear regression analysis was carried out by fitting a linear model to each of the C 3 and C 4 datasets, with the slope of this regression representing the environmental change per unit of geographical expansion.  (2010) Temperature of the Wettest Quarter (Growing Season) C 4 trees are found in areas with higher growing season temperatures than C 3 trees Teeri and Stowe (1976), Vogel et al. (1986) Precipitation of Driest Month C 4 trees are found in areas with lower minimum monthly precipitation levels than C 3 trees Pyankov et al. (2010) Precipitation Seasonality C 4 trees are found in areas with more seasonal precipitation than C 3 trees Edwards and Smith (2010) Solar Radiation C 4 trees are found in areas with higher solar radiation than C 3 trees Ehleringer (1978) Climatic Water Deficit C 4 trees are found in areas with higher climatic water deficits than C 3 trees Witwickiet al. (2016) Ecological Vegetation Cover C 4 trees are found in areas with lower vegetation cover than C 3 trees Pau et al. (2013), Still et al. (2014) a Supporting references are studies showing environmental distributions for C 3 and C 4 herbaceous species.
To distinguish the role that evolutionary history may have played in photosynthetic type sorting across these six environmental variables, phylogenetic generalized least squares (PGLS) analyses were performed using the pgls function in the caper package in R (Orme et al., 2018). A comparative dataset was assembled from species mean data for each of the seven environmental variables used in the PCA, the first three principal components (PCs) of the PCA, and the species phylogeny. This dataset included a variance-covariance (VCV) matrix that represents species' phylogenetic relationships to one another. A PGLS model was then fitted to each of the seven environmental variables and three PCs in the dataset with photosynthetic type as the categorical predictor. Pagel's lambda (λ) was calculated to estimate the strength of the phylogenetic signal in the mean response of each species on a scale of zero (no phylogenetic dependence) to one (perfect phylogenetic dependence). ANOVA was then performed on each model.

Geographical and environmental distributions of C 3 and C 4 trees overlap
C 3 , C 4 , and CAM Euphorbiaceae trees have largely overlapping geographical distributions across the Hawaiian Islands (Fig. 1A). Based on occurrence data available in the BIEN and GBIF databases, both C 3 and C 4 trees in Euphorbiaceae occur on the islands of Kaua`i, O`ahu, Lana`i, Maui, and Hawai`i. Only a single Euphorb tree species, Euphorbia celastroides, occurs on Molokaa`i. In fact, E. celastroides is the only C 4 tree species recorded on each of Molokaa`i, Lana`i, and Maui, and has the most widespread distribution of any of the C 4 trees. Only one island-Ni`ihau-has no record of Euphorb trees. There are also three CAM tree species in Euphorbiaceae on the Hawaiian Islands (Jatropha curcas, Euphorbia lactea, and E. tirucalli), which occur on Maui, O`ahu, and Hawai`i (Fig. 1A). Trees in Euphorbiaceae occur across a broad range of elevations across the Islands, from -5 m to 3212 m (Fig. 1B). C 3 trees occupy this entire range of elevations, but are largely skewed toward the lower elevations, with a median value of 170 m. C 4 trees are found at elevations of 6 m to 3212 m, and are more evenly distributed across this range than their C 3 counterparts, with a median elevation of 526 m.
Overall, trees in Euphorbiaceae are broadly distributed across precipitation regimes that span the range found across the Hawaiian Islands ( Fig. 2A, B). C 3 trees occur in areas with as little as 232 mm mean annual precipitation (MAP) on the northwest coast of Hawai`i and as high as 9010 mm MAP in Kaua`i. Similarly, C 4 trees have been recorded on the south slopes of Mauna Kea in Hawai`i with only 356 mm MAP, but also in Kaua`i with MAP up to 9010 mm. The distribution of Euphorbiaceae trees across temperatures was narrower, but also reflected that of the Hawaiian Islands ( Fig. 2A, B). Mean annual temperature (MAT) ranged from 6.6 to 23.8 °C, with the majority of trees occurring within a much narrower range of 18-24 °C. While C 3 and C 4 trees inhabited similar temperature spaces, C 3 trees were more skewed towards the upper end of the temperature range compared with C 4 trees. According to the Whittaker Biomes theory (Whittaker, 1975), these MAP and MAT ranges suggest that Hawaiian Euphorb trees are most commonly found within tropical rainforest and tropical seasonal forest/savanna biomes. C 3 and C 4 trees are similarly distributed across these biomes ( Fig. 2A, B).

C 4 photosynthesis expands the ecological range of trees in Euphorbiaceae on the Hawaiian Islands
We performed three PCAs to determine which environmental and soil variables most strongly distinguish the distributions of C 3 and C 4 individuals on the Hawaiian Islands. The first PCA incorporated the seven environmental explanatory variables that we hypothesized would differ in C 3 and C 4 trees based on the published literature on monocots and herbaceous eudicots (Table 2). This generated three PCs which captured nearly 85% (39.88% PC1, 31.22% PC2, and 13.31% PC3) of the variation across the dataset ( Fig. 2C; Supplementary Fig. S8). Wettest quarter temperature, minimum coldest month temperature, vegetation cover, and solar radiation were most strongly correlated with PC1, while precipitation seasonality, precipitation of the driest month, and CWD were most strongly correlated with PC2 ( Fig.  2C; Supplementary Table S3). The two temperature variables showed strong positive correlation with each other in the PCA space, while the pair of precipitation variables showed strong negative correlation with each other, as did vegetation cover and solar radiation. There is a large degree of overlap between the C 3 and C 4 groups in the PCA space, with the 95% confidence ellipse for the C 3 group almost completely contained within that of the C 4 group. The C 3 and C 4 groups occupied a similar range of values on PC2 but the C 4 group had a much broader distribution than the C 3 group on PC1 with a skew towards lower PC1 values. This indicates that C 4 trees occupy a broader range of ecological niches than C 3 trees, and that environments with low temperature and vegetation cover, and high levels of solar radiation are more often occupied by C 4 trees than their C 3 counterparts.

(B) (A)
To determine whether the difference in ecological space between C 3 and C 4 trees was larger or smaller than that of closely related herbaceous and shrub life forms, we performed a second PCA on the same seven environmental variables but including all growth forms (herb, shrub, and tree) (Supplementary Figs S5, S6). PC1 accounted for 36.18% of the variation in the dataset, while PC2 accounted for 34.23% (Supplementary Fig.  S5). The results of this analysis show that, similarly to trees, C 4 shrubs occupy a broader range of ecological niches than C 3 shrubs ( Supplementary Fig. S5C). The niche-broadening effect is even more pronounced in shrubs than in trees, and is driven by all of the seven environmental variables. As such, when considered together, C 4 woody species (i.e. shrubs and trees) generally occupy a greater ecological niche than their C 3 counterparts ( Supplementary Fig. S5E). Conversely, C 3 and C 4 non-woody species (i.e. herbs) in this family have a smaller degree of ecological difference than shrubs and trees (shown by their more similar distributions in the PCA space), but C 4 herbs have a slightly broader range of values for PC2 (and to a lesser extent PC1) compared with C 3 herbs ( Supplementary  Fig S5B). PC2 is most strongly correlated with solar radiation, vegetation cover, and precipitation of the driest month (Supplementary Table S4), so this suggests that C 4 herbs occupy brighter, more open, drier environments than their C 3 counterparts.
The third PCA incorporated all the continuous soil variables available from the Hawaii Soil Atlas for C 3 and C 4 tree species (Supplementary Fig. S7). PC1 accounted for 35.7% of the variation in the dataset, while PC2 accounted for 29.95%. The variables of OM, K sat , and pH were most strongly correlated with PC1, while CEC was most strongly correlated with PC2 (Supplementary Table S5). There was almost complete overlap of the 95% confidence ellipses for the C 3 and C 4 groups in the PCA space ( Supplementary Fig. S6), indicating that soil variables did not differ greatly between the C 3 and C 4 trees in this dataset.

C 4 trees expanded their ecological range more than C 3 trees as they dispersed across the Islands
We used Mantel permutation tests to examine the association between ecological and geographical expansion in C 3 and C 4 trees. The results of the Mantel tests show a positive correlation between ecological and geographical expansion for both C 3 (P=0.019) and C 4 (P<0.001) trees. This was confirmed by linear regression analysis, which showed that the regression line for the C 4 group had a slightly steeper slope (0.011) than the C 3 group (0.0097), indicating that C 4 trees expanded their ecological niche more for a given degree of geographic dispersal compared with C 3 trees (Fig. 3). This is possibly associated with a small number of C 4 trees accessing higher elevation environments, where they are disproportionately abundant relative to C 3 trees (Fig. 1B).

Evolutionary history does not strongly influence C 3 and C 4 tree distributions
We generated a phylogeny of Euphorbiaceae trees found on the Hawaiian Islands using previously published sequence data (Fig. 4). Our phylogeny showed that C 4 tree species on the Hawaiian Islands form a monophyletic group, consistent with previous phylogenies [Fig. 4;Yang et al., 2018 (posterior probability=1, bootstrap=100)]. This is expected given that the Hawaiian Chamaesyce radiation was previously reported as being from a single common ancestor. We used the phylogeny to perform PGLS analyses on each environmental variable and the first three PCs of the first PCA (which included only tree species) to support the findings of the PCA and determine the role of evolutionary history in C 3 and C 4 tree environmental distributions. These analyses showed no influence of evolutionary history in six of the seven environmental variables and two of the three PCs, and that none of the environmental variables or PCs showed significant differences between photosynthetic types after accounting for any phylogenetic signal ( Table 3). The temperature variables did, however, show marginally significant differences between photosynthetic types. The PGLS showed evidence of strong phylogenetic signal in CWD and PC2 (λ=0.954 and λ=1.000, respectively); however, these variables did not show significant differences between photosynthetic type.

C 3 and C 4 trees have more similar geographical and environmental distributions than C 3 and C 4 herbaceous species
The results presented here suggest that C 3 and C 4 tree species across the Hawaiian Islands inhabit similar geographical and environmental spaces. There has been no significant niche shift after the single origin of the C 4 pathway in this clade, as indicated by the paucity of phylogenetic signal in the data. Where phylogenetic signal was present, in CWD and PC2, this was not associated with a significant difference between the two photosynthetic types. This suggests that this phylogenetic structuring is present across the whole family and is not driven by a niche shift associated with the evolution of C 4 photosynthesis.
The similar ecological distributions of C 3 and C 4 tree species contrasts with previous studies on herbaceous species from other families and geographical regions, which show clear differences between the ecological niches of C 3 and C 4 grasses and forbs (Teeri and Stowe, 1976;Stowe and Teeri, 1978;Pyankov et al., 2010;Bremond et al., 2012;Pau et al., 2013). Given that all true angiosperm trees are eudicots, and, in C 3 versus C 4 eudicots, aridity is an important determinant of distribution (more so than temperature, which is the primary determinant for monocots), we hypothesized that aridity would be the primary environmental determinant distinguishing the Fig. 3. Comparison of geographical and ecological distances for C 3 and C 4 trees across the Hawaiian Islands. Ecological and geographical distances were obtained for pairs of C 3 (blue) and C 4 (red) individuals. Regression lines forced to the origin were identified for each of the C 3 and C 4 groups, and the slopes calculated. distributions of C 3 and C 4 trees. However, we found no difference in the driest month precipitation, precipitation seasonality, or CWD between C 3 and C 4 trees, despite highly variable precipitation levels across the Hawaiian Islands ( Fig. 2; Table 3). In fact, precipitation variables seemed to be the least important factors in determining C 4 tree distribution, with solar radiation, vegetation cover, and temperature driving differences in distributions between the C 3 and C 4 groups in the PCA space (Fig. 2). Soil variables were notably similar between the C 3 and C 4 groups; however, this is consistent with a previous study on the biogeographic controls on C 3 and C 4 grass distributions (Griffith et al., 2015).
There are many possible factors contributing to the similar environmental distributions of C 3 and C 4 trees. First, the height of trees relative to herbaceous species may play a role, as tall plant height may confer greater embolism vulnerability under drought (Olson et al., 2018), and this inherent limitation may occur regardless of photosynthetic type. However, the C 4 trees on the Hawaiian Islands have not been observed to exceed 10 m in height (Pearcy and Troughton, 1975;Sporck, 2011) so this effect is likely to be minor. Second, and more likely, the small geographical area of the Hawaiian Islands (~28 000 km 2 ) may also constrain the degree of environmental differences between C 3 and C 4 trees. Strong trends in global distribution patterns are observed for C 3 and C 4 grasses, where the geographical area in question is much larger: global grassland areas comprise ~52.5 million km 2 (Sage et al., 1999;White et al., 2000). However, previous studies of C 3 and C 4 grasses on the Hawaiian Islands have revealed significant differences in environmental distributions (Edwards and Still, 2008;Still et al., 2014), so while differences in ecological niche may be less pronounced in an island environment, they are not necessarily completely obscured. Third, the nature of the climate and soil datasets may mask some of the true variability in environmental niche. In the case of the Hawaiian climate datasets, the data are averaged over multiple years, and the Hawaiian soil atlas data are only available for a limited set of variables and a limited number (61% in this study) of occurrence data points. Notably, the Hawaiian soil atlas lacks data on soil nitrogen content, although soil nitrogen availability is related to some of the other variables included in this analysis such as CEC (which is linked to NH 4 + levels) and pH and organic matter content (which are linked to nitrate levels) (Kemmitt et al., 2006). Soil nitrogen availability to plants may also be influenced by temperature, slope, soil aeration, and soil water (Amer and Bartholomew, 1951;Stanford et al., 1975;Kong et al., 2019), all of which vary across the Islands. It is possible that soil nitrogen availability may affect the distribution of the different photosynthetic types in this study given that the C 4 pathway increases nitrogen use efficiency (Pearcy and Ehleringer, 1984), but this effect cannot be detected in our analyses. However, this is likely to be a limitation for most studies of this nature, many of which still show clear differences in the ecological niches of C 3 and C 4 herbaceous species. Finally, life history is probably also significant. Grasses can have both annual and perennial life history strategies, and it is annual grasses (particularly Fig. 4. Phylogeny of Hawaiian Euphorbiaceae trees. Evolutionary relationships between C 3 (blue, n=16), C 4 (red, n=5), and CAM (black, n=3) Hawaiian Euphorbiaceae trees are presented in a pruned maximum credibility tree inferred from the BEAST analysis of a dataset representing 112 species in Euphorbiaceae. The C 3 species Macaranga mappa is missing from this tree due to an absence of available published nucleotide data. The Hawaiian species Claoxylon sandwicense is represented by the congeneric species C. glandulosum. The Hawaiian species Croton guatemalensis is represented by the congeneric species C. insularis. C 4 annuals) which grow in the hottest environments with the most seasonal precipitation, whereas perennial grasses tend to grow in regions with lower, more seasonal precipitation (Liu et al., 2019). This indicates that C 4 photosynthesis may be more important for carbon accumulation in hot, dry environments for fast-lived grasses, but not as important in longer lived species such as trees.

C 4 trees occupy a broad ecological niche on highly heterogenous Islands
Although C 3 and C 4 trees occupy similar environments, the results of the PCA and linear regression analysis of the environmental versus geographical distances suggest that C 4 photosynthesis does have a niche-broadening effect across all growth forms in Euphorbiaceae, with this effect being the most pronounced in woody species (Figs 2, 3; Supplementary Fig. S5).
In particular, C 4 tree species have expanded their range into environments with characteristically sparse vegetation cover, higher sunlight, and cooler temperatures (Fig. 2C). Reduced vegetation cover, increased solar radiation, and cooler temperatures are all correlated with increasing elevation levels ( Supplementary Fig. S9). This suggests that C 4 trees have expanded their ecological niche relative to their C 3 counterparts as they moved into higher elevation environments. It is worth noting that this niche broadening is not an effect of different sample sizes as there are fewer C 4 than C 3 tree occurrence points, and broader ecological distributions of C 4 species are seen in all life forms in this study, despite differences in relative sample sizes of C 3 versus C 4 groups. Furthermore, a broadening of the environmental niche associated with C 4 photosynthesis is consistent with a previous intraspecific study in grasses (Lundgren et al., 2015). As such, it may be that C 4 photosynthesis does influence the niche of woody species on the Hawaiian Islands, but that there has not been sufficient opportunity for niche specialization among C 4 tree species to generate an apparent niche shift relative to their C 3 counterparts due to their restricted global range. As previously stated, the small geographic range of C 4 trees could have acted to limit the environmental conditions that they can access, even though this limitation is minimized by two factors.
First, the trees have dispersed across six of the Hawaiian Islands (Fig. 1), which gives them the opportunity to access the full range of the environments and climates that occur across the Islands (Fig. 2). This dispersal across the Islands has probably been facilitated by the seed characteristics of the ancestor of the C 4 Hawaiian Euphorbia, which had small seeds with mucilaginous seed coats that adhered to birds to facilitate dispersal (Price and Wagner, 2004). Some species of Hawaiian Euphorbia have then undergone habitat specialization following dispersal, Table 3. Results of phylogenetic generalized least squares (PGLS) analysis and ANOVA for the effects of photosynthetic type on environmental variables and principal components (PCs) for C 3 and C 4 trees on the Hawaiian Islands. such as the C 4 tree E. rockii, which is a single-island endemic on O`ahu (Yang et al., 2018). Euphorbia rockii has large, non-sticky seeds which may be beneficial for seedling survival in forest understorey habitats, but also have reduced dispersal (Koutnik, 1987;Jordan and Hayden, 1992;Yang et al., 2018). It is worth noting that there are some islands, such as Ni`ihau, that have no record of Euphorb species (Supplementary Fig. S3), which may be a result of limited sampling rather than true absence. Second, the environments of the Hawaiian Islands are highly heterogenous, which generates large environmental gradients over relatively small geographic distances (Barajas- Barbosa et al., 2020). This underlies the suitability of the Hawaiian Islands for this type of analysis: there is the potential for large ecological niche variation between species in close geographic proximity and, as such, there is a high degree of potential variation in climatic and environmental variables between individuals. Indeed, studies of the ecological differences between C 3 and C 4 grasses on the Islands have identified trends in ecological distributions that are consistent with globally observed patterns (Pau et al., 2013;Still et al., 2014).
Despite these mitigating factors, it is likely that geographical limitation still acts to constrain the magnitude of potential environmental differences between C 3 and C 4 trees. As the Hawaiian Islands formed, this generated a high availability and diversity of new niches, which probably facilitated the radiation of the C 4 Euphorbia (Yang et al., 2018). However, the small and isolated nature of the Hawaiian Islands means that the potential for niche shifts following this initial radiation is likely to be lower than that on a continental land mass. This is reflected in the limited environmental differences that are seen across all growth forms ( Supplementary Fig. S5) in Euphorbiaceae on the Hawaiian Islands. Indeed, herbaceous C 3 and C 4 Hawaiian Euphorbs seem to grow in more similar environments than has previously been observed for their counterparts in other geographical regions (Batanouny et al., 1991).

Euphorbiaceae is a morphologically and photosynthetically diverse plant family
In addition to the geographic limitations of the Hawaiian Islands, there are further complications in this study in conclusively determining growth form and photosynthetic type in Euphorbiaceae.
With respect to growth form, there is no unilaterally accepted definition of a tree: it is not a single phylogenetic grouping, but a life 'strategy' that can vary between and within species. For example, Euphorbia celastroides can achieve the tree growth form, but also has varieties with shrubbier, more prostrate growth forms (Sporck, 2011;Yang et al., 2018), and distinguishing between a large shrub and a small tree of the same species is subjective. These intraspecific differences are not resolved within the available occurrence data, so certain individuals identified as trees in this study may have had shrubbier growth forms. However, this issue is mitigated somewhat by the fact that the ecological differences between C 3 and C 4 shrubs and trees seem to be driven by similar factors, and the niche-broadening effect of C 4 photosynthesis is consistent across all woody species (Supplementary Fig. S5).
Furthermore, photosynthetic type has not been conclusively established for all the species in Euphorbiaceae, as determining photosynthetic type can require a combination of carbon isotope discrimination (δ 13 C), gas exchange, and leaf anatomy data. These data are not available in the literature for all species: for most species, only δ 13 C data could be obtained. As such, there may be unrecognized photosynthetic diversity within species currently classified as using C 3 photosynthesis. This is made more likely by the fact that Euphorbiaceae is a highly photosynthetically diverse plant family, known to include species using C 3 , C 4 , C 3 -C 4 intermediate, and CAM modes of photosynthesis, although there are currently no known C 3 -C 4 trees in Euphorbiaceae or otherwise.

Photosynthetic diversity in Euphorbiaceae provides insights into C 4 evolution in tree species
The reason for the global rarity of C 4 and C 3 -C 4 tree species is an ongoing question in plant ecophysiology and evolution. In order to answer this question, the two potential evolutionary routes to C 4 photosynthesis in trees must be assessed (Young et al., 2020).
First, C 4 photosynthesis may arise in an existing tree, via a C 3 -C 4 intermediate state. The feasibility of this evolutionary path cannot be confirmed due to the apparent absence of C 3 -C 4 trees from global plant biodiversity. Given the high degree of photosynthetic diversity in woody Euphorbs, the lack of C 3 -C 4 photosynthesis is particularly apparent. Indeed, in Euphorbia alone, there have been at least 17 independent evolutions of a photosynthetic CO 2 -concentrating mechanism; 16 evolutions of CAM and one of C 4 (Horn et al., 2014). The C 4 lineage was ancestrally herbaceous, while all 16 evolutions of CAM occurred in ancestrally woody lineages. In view of this, Horn et al. (2014) hypothesize that life history is important in establishing evolutionary trajectory. We hypothesize that biogeographical factors associated with life history traits may also affect the ability of a lineage to evolve C 4 photosynthesis; that is, a lower quantum yield in C 3 -C 4 photosynthetically intermediate species (i.e. potential C 3 -C 4 trees) could occlude them from shaded habitats (Monson et al., 1986;Monson and Moore, 1989). This could therefore select against the evolution of intermediate photosynthetic types, and thus C 4 photosynthesis, in ancestrally woody forest species, and provides some explanation for the absence of woody C 3 -C 4 species.
Second, a C 4 tree species may evolve from an herbaceous C 4 ancestor. This was the route taken by the Hawaiian Euphorbia, indicating that this is a feasible evolutionary path (Yang et al., 2018). However, if the transition to the tree state means that the potential benefits of the C 4 pathway are diminished or eliminated, this reduces the likelihood of C 4 trees emerging via this pathway and persisting in competitive environments, thus providing some explanation for the rarity of C 4 trees. The minimal ecological differences between C 3 and C 4 trees presented here offer some support for this explanation: there may be little to no benefit, or even a cost, to the C 4 photosynthetic pathway in trees in terms of carbon or water gain in hotter and/or drier climates. It is possible that C 4 photosynthesis has only persisted in trees on the Hawaiian Islands due to the high availability and diversity of new niches that emerged as the Islands formed (which has been shown to be consistent with a high level of species diversity; Sebastian et al., 2012;Barajas-Barbosa et al., 2020), rather than the C 4 photosynthetic pathway providing any benefit over the ancestral C 3 state. However, it is difficult to fully assess the validity of this conclusion given that the Hawaiian Euphorbia represent a single C 4 lineage (one of 34 in the eudicots), and the only C 4 lineage to include true trees (Sage, 2016).
If indeed it is the case that the C 4 pathway provides limited benefit in trees, then this not only helps to explain why C 4 photosynthesis is so rare in trees but also provides insight into the value (or lack thereof) of C 4 trees from an engineering perspective. If the C 4 pathway did allow trees to perform better under hot and/or dry environmental conditions, the engineering of C 4 photosynthesis into trees would be a valid target for futureproofing against climate change to secure timber yields or forest carbon sequestration. However, if the C 4 pathway does not influence tree ecology in this way, as the results presented here suggest, then this approach would not be beneficial. C 4 trees may only be exceptional with respect to their rarity, and not their performance relative to their C 3 counterparts.

Conclusions
Our results suggest that C 3 and C 4 trees in Euphorbiaceae inhabit similar environments across the Hawaiian Islands. This may, in part, be due to the limited geographic range of C 4 trees globally, but may also indicate that C 4 photosynthesis is not beneficial in trees in terms of accessing hotter and/or drier climates. However, C 4 photosynthesis does appear to have a niche-broadening effect in tree species, and C 4 trees have expanded their range on the Hawaiian Islands into environments with characteristically sparse vegetation cover, higher sunlight, and cooler temperatures. The high level of photosynthetic and morphological diversity in Euphorbiaceae merits further investigation to conclusively establish (i) how C 4 photosynthesis, in this rare case, evolved in trees; (ii) whether there has been evolution of intermediate photosynthetic types in woody species in this family; and (iii) whether these intermediate photosynthetic types may confer any benefit over the C 3 state.

Supplementary data
The following supplementary data are available at JXB online. Fig. S1. Map of occurrence data flagged by CoordinateCleaner. Fig. S2. Map of occurrence data by data source. Fig. S3. Map of distributions of Hawaiian herbs, shrubs, and trees by photosynthetic type. Fig. S4. Complete phylogeny of all species whose data were obtained in this study. Fig. S5. Principal components 1 and 2 of a principal component analysis (PCA) of environmental variables for all growth forms. Fig. S6. Principal components 3 and 4 of a principal component analysis (PCA) of environmental variables for all growth forms. Fig. S7. Principal component analysis (PCA) of soil variables for C 3 and C 4 trees. Fig. S8. Histogram of principal component 3 of a principal component analysis (PCA) of environmental variables for C 3 and C 4 trees. Fig. S9. Distribution of C 3 and C 4 trees by elevation and environmental variables correlated with elevation. Table S1. Issues identified by GBIF which resulted in a record being removed from this analysis Table S2. Summary of sequence data used and accession numbers. Table S3. Environmental variables correlated to PCA axes (trees only). Table S4. Environmental variables correlated to PCA axes (all life forms). Table S5. Soil variables correlated to PCA axes (trees only).